Increase initialCapacity for HashSet in ExtractDependencies.scala #18219

lolgab · 2023-07-16T16:54:34Z

Growing that HashSet seems to take 1.27% of the allocations when compiling dotty itself.

Growing that HashSet seem to take 1.27% of the allocations when compiling dotty itself.

lolgab · 2023-07-16T16:56:08Z

test performance please

nicolasstucki · 2023-07-17T07:42:20Z

test performance please

dottybot · 2023-07-17T07:44:00Z

performance test scheduled: 239 job(s) in queue, 1 running.

nicolasstucki · 2023-07-17T08:14:49Z

compiler/src/dotty/tools/dotc/sbt/ExtractDependencies.scala

    // tests/run/enum-values.scala) and the symbols of named types (testcase:
    // tests/pos-java-interop/i13575) we've seen before.
-    val seen = new mutable.HashSet[Symbol | Type]
+    val seen = new mutable.HashSet[Symbol | Type](initialCapacity = 128, loadFactor = mutable.HashSet.defaultLoadFactor)


128 seems a bit too excessive. If we look at the histogram of maximum sizes we see that 64 is enough to cover most cases (99%). Even 32 we would cover 87% of cases and have an extra allocation for the 13% (larger cases).

I measured it when running scala3-bootstrapped/compile after a clean.

@nicolasstucki Nice histogram! I wanted to build one too but gave up eventually. How did you do it?
Do your percentages consider the loadFactor of 75%? Should I update the initialCapacity to 64?

def addTypeDependency(tpe: Type)(using Context): Unit = { val traverser = new TypeDependencyTraverser { def addDependency(symbol: Symbol) = addMemberRefDependency(symbol) } traverser.traverse(tpe) println("seen: " + traverser.seen.size) } def addPatMatDependency(tpe: Type)(using Context): Unit = { val traverser = new TypeDependencyTraverser { def addDependency(symbol: Symbol) = if (!ignoreDependency(symbol) && symbol.is(Sealed)) { val usedName = symbol.zincMangledName addUsedName(usedName, UseScope.PatMatTarget) } } traverser.traverse(tpe) println("seen: " + traverser.seen.size) }

sbt "clean; scala3-bootstrapped/compile" > sbtOutput.txt then filter out seen: from that file and import the numbers in google sheets.

I did not consider the load factor. My percentages only take into account the size of the set. I would go for 64 if we want to minimize those allocations.

64 is enough to cover almost 99% of the cases so it's a good trade-off between memory consumed and allocations count.

nicolasstucki · 2023-07-17T09:18:51Z

test performance please

dottybot · 2023-07-17T09:20:15Z

performance test scheduled: 239 job(s) in queue, 1 running.

dottybot · 2023-07-17T10:50:32Z

Performance test finished successfully:

Visit https://dotty-bench.epfl.ch/18219/ to see the changes.

Benchmarks is based on merging with main (2753a17)

nicolasstucki · 2023-07-17T11:13:33Z

There are 3 months of non-benchmarked commits #18221 (comment). The performance regression cound have happened before this change.

mbovel · 2023-07-17T12:22:16Z

There are 3 months of non-benchmarked commits

These are being benchmarked. It should take less than two weeks. I suggest you wait to have the missing info to check if this PR is responsible for the regression or not.

bishabosha · 2023-07-19T14:49:30Z

Here is my comparison before and after of cpu time (on warmed up scala3-compiler-bootstrapped):

before:

after:

so now traversal takes 47% (cpu time) of the phase instead of 52% (more of a memory bottleneck is allocating Node in the hashset)

I also noticed on the side that recordDependency is doing a lot of time interacting with the filesystem since the Tasty classpath changes

lolgab · 2023-08-12T10:06:41Z

Can you trigger another performance test now?

bishabosha · 2023-08-15T09:25:56Z

Im also working on a more thorough optimisation for this phase, but sure lets do the bench

bishabosha · 2023-08-15T09:28:04Z

test performance with #sbt please

dottybot · 2023-08-15T09:29:45Z

performance test scheduled: 3 job(s) in queue, 1 running.

dottybot · 2023-08-15T11:48:03Z

Performance test finished successfully:

Visit https://dotty-bench.epfl.ch/18219/ to see the changes.

Benchmarks is based on merging with main (ca6a80e)

alternative to #18219 reduces allocations in the phase by 77% (on `scala3-compiler-bootstrapped`), allocations in the whole compilation pipeline are reduced from about 12% to 4% contributed by this phase. Reduced allocations come from: - reusing a single set for the `seen` cache. - use linear probe maps (dotty.tools.dotc.util) avoiding allocation of nodes - no `ClassDependency` allocation each time a dependency exists (many are likely duplicated) performance is also improved due to less expensive hashing (`EqHashMap`), and introduction of `EqHashSet`, also reducing number of times we hash (`add` method on HashSet, and optimised `getOrElseUpdate`). Probably also from reduced heap pressure of allocating. Now the majority of allocation for this phase is by calling into Zinc

bishabosha · 2023-10-11T12:06:15Z

closing now that #18403 is merged

Increase initialCapacity for HashSet in ExtractDependencies.scala

f7bccb0

Growing that HashSet seem to take 1.27% of the allocations when compiling dotty itself.

nicolasstucki mentioned this pull request Jul 17, 2023

Benchmark server backlog #18221

Closed

nicolasstucki reviewed Jul 17, 2023

View reviewed changes

nicolasstucki assigned lolgab Jul 17, 2023

Reduce initialCapacity to 64

26630e8

64 is enough to cover almost 99% of the cases so it's a good trade-off between memory consumed and allocations count.

lolgab marked this pull request as ready for review July 17, 2023 09:04

bishabosha mentioned this pull request Aug 15, 2023

ExtractDependencies uses more efficient caching #18403

Merged

bishabosha closed this Oct 11, 2023

lolgab deleted the bigger-table-hash-set branch October 11, 2023 12:57

Increase initialCapacity for HashSet in ExtractDependencies.scala #18219

Increase initialCapacity for HashSet in ExtractDependencies.scala #18219

Uh oh!

Conversation

lolgab commented Jul 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lolgab commented Jul 16, 2023

Uh oh!

nicolasstucki commented Jul 17, 2023

Uh oh!

dottybot commented Jul 17, 2023

Uh oh!

nicolasstucki Jul 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lolgab Jul 17, 2023

Choose a reason for hiding this comment

Uh oh!

nicolasstucki Jul 17, 2023

Choose a reason for hiding this comment

Uh oh!

nicolasstucki Jul 17, 2023

Choose a reason for hiding this comment

Uh oh!

nicolasstucki commented Jul 17, 2023

Uh oh!

dottybot commented Jul 17, 2023

Uh oh!

dottybot commented Jul 17, 2023

Uh oh!

nicolasstucki commented Jul 17, 2023

Uh oh!

mbovel commented Jul 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bishabosha commented Jul 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lolgab commented Aug 12, 2023

Uh oh!

bishabosha commented Aug 15, 2023

Uh oh!

bishabosha commented Aug 15, 2023

Uh oh!

dottybot commented Aug 15, 2023

Uh oh!

dottybot commented Aug 15, 2023

Uh oh!

bishabosha commented Oct 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lolgab commented Jul 16, 2023 •

edited

Loading

nicolasstucki Jul 17, 2023 •

edited

Loading

mbovel commented Jul 17, 2023 •

edited

Loading

bishabosha commented Jul 19, 2023 •

edited

Loading